General Disclaimer 


One or more of the Following Statements may affect this Document 


• This document has been reproduced from the best copy furnished by the 
organizational source. It is being released in the interest of making available as 
much information as possible. 


• This document may contain data, which exceeds the sheet parameters. It was 
furnished in this condition by the organizational source and is the best copy 
available. 


• This document may contain tone-on-tone or color graphs, charts and/or pictures, 
which have been reproduced in black and white. 


• This document is paginated as submitted by the original source. 


• Portions of this document are not fully legible due to the historical nature of some 
of the material. However, it is the best reproduction available from the original 
submission. 


Produced by the NASA Center for Aerospace Information (CASI) 











K. ^ _ 


79- 10.0. aa 

Q.'i?- /57f// 


•A ^ 

I’^V 

^ 4* # 


*v%^ •<? ^ Ci^ 

* ..5 ^ 


^ ^ iy :k 


(E79- 10022) STATISTICAL SEPARABILITY AND 
CLASSIFICATION OF LAND OSE CLASSES USING 
I'1AGE-100 (Instituto de Pesquisas Espaciais, 
Sao Jose) 17 p HC A02/HF A01 CSCL 05B 


G3/43 


N79-12533 


tJnclas 

00022 


COKSELHO KACIONAL DE DESENVQLVIMENTO CIENTfFICO E TECNOLOGIC O 

INSTITOTO Of PfSOUISAS ESPAGfAIS~ 





T9- 1 OiO.^& 

0 . 1 ?- / 57 < 7 // 


(i?79- 10022) STATISTICAL SEPAPAfllllTY A»»D W79-12'j33 

f'l ASSIFICAi’IO’l OF LAUD 1SE CLASSES OSING 
I1»GE-100 (In3i;it ut.o de Pesquisas Espacials, 

•=:io Jose) 17 p HC A02/MF A01 CSCL 05P Inclas 

G3/a3 00072 


I 


CONSELMO NACIOMAL DE PESENVOtVIMENTQ CIENTIFICO E TECNOLOG.CO 




N 




ORIGINAL PAGE IS 
OF POOR QUALIIY 


I pj-. : - ;. V, ^ ^ . ; , ( 


STATISTICAL SKPASAMLITY AND CLASSIFICATION OF 
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Coiiselho Nacional du Dusenvolvimcnto CientiCico c Tecnoloyico 
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12.2U0 - S, J. dos Campos, S.l*. , Brasil j 


Typ- ..-Abstract 


j Th'c purpose of this study was to coHipare the classifica- | 
tion accuracy of land use classes of S.J, dos Campos S-.P. , 
ilrasil, using the. different options of classification availa- 
ble in the Iraagc-100 of General Electric Co. and the. sample 
plassifier. In addition, the statistical separability of land 
psc classes in the subsets of one to four spectral channels 
was investigated. With the help of ground observations and 
aerial photography, the multispectral scanner (MSS) data of 
LANDSxVT were analysed using the Image- 100. In the subsets of 
one to three spectral channels (ch, 4 = 0.3 to 0.6 um, ch. 5 = 
0.6 to 0.7 Pm, ch. 6 “ 0.7 to 0.8 pm, ch. 7 “0.8 to 1.1 pm); 
phannel 4, channels 4 & 7, channels 4, 5 & 7 were found to he 
the best choices. For the single-cell option of the Image-100, 
the errors of omission varied from 5% for the class industrial 
to 46% for the class institutional. The errors of commission 
varied from 11% for the class commercial to 39% for the class 
industrial. As expected, the multi-cell option increased the 
errors of omission and decreased the errors of commission, 
however, considering both the errors of omission and commission, 
this option considerably decreased the percentage of correct 
classification as compared to the single-cell option. On the 
whole, the sample classifier gave considerably more accurate 
results as compared to- the single-cell or multi-cell option. 
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I I « 

{ Tiic i>uipose of this study vas co:::parc the classifica- 
tioa accuracy of land use classes vf S.J. dos C.icipos , S.P. » 
brasily using the diflereut options of cla'».»il ication availa- 
ble in the luagc-lOO of General Llectric Co. and the SiU'.iple 
classifier. In additioii, the statistical scparnlii li ty of land 
use classes in the subsets of one to four spectral channels 
was investigated. With the help of ground observations and 
aerial photography, the ttulcispectral scanner (MSS) data of 
L/VitDSAT were analysed using die Image- 100. In the subsets of 
one to three spectral channels (ch. A » 0.5 to O.b , ch. 5 * 

U.6 to 0.7 bi.i, ch. 0 “ 0.7 to 0.8 pm, c!i. 7 ■ 0.8 to 1. 1 nm) ; 
channel 4, channels 4 & 7, channels 4,547 were found to be 
the best ciioices. For the single-cell option of the linage- 100, 
the errors of omission varied from 5^ for the class industrial 
to A6Z for the class institutional. The errors of corarni sricn 
varied from 11% for the class conrnercial to 39% for the clnas 
industrial. As expected, the multi-cell option increased the 
errors of omission and decreased the errors of conmission. 
However, considering both the errors of omission and conrnin sion, 
this option considerably decreased the percentage of correct 
classif ication as compared to tiie single-cell option. On the 
whole, the sample classifier gave consiuerably more accurate 
results as cump.ired to tlie single-cell or multi-cell option. 
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lutiroduction 

TIk* purpose of this study was to cowparo the cl.asisi fi ea~ 
tiou accuracy of latul use classes usinp, the different options 
of classification availiible in the Iraap.e' lUO (Itiiape^lOO is a 
data proeeasinp, system mavUetetl by Oeueral rieetric Co. , that 
extracts thematie information from multispectral iinaj'.cry» 
enhances the iiiiap.e , etc.)* In addition, the statistical sepa- 
rability of land use classes in the suhset.s of one to four 
spectral channels was investi p.ated. • . 


Cloud free multispootral scanner data from hANDSAT, of 
reasonable cjuality, over .Sao Jose dos Campos (23*^ 10’ R, /i5° 

30' W) , Sao Paulo, Brasil, acquired on September B, 1972, were 
available. In addition, aeri.al photot’rapliy and ground observa' 
cions were available to assist tlie analysis of the data, Jjao 
Jose dos Campos was selected because it is one of the most 
industrialized and fastest growing small-size towns of Brasil 
a ul the authors are well familiar with it. Many of tin’ problems 
of this tov^ni are similar to the problems of much larger urban 
center’s.' • . 


Witli Lire help_ot ground obsnrv.itioini and aerial photo- 
graphy, a m.ap of Rao Jose dos Campos showing the following 
land use classes was obtained: residential areas, multi-family 
residential areas, commercial .u’ens, industrial areas, insti- 
tutional areas, agricultural areas and unoccupied areas. 


lows: 


Tlie specific objectives of the study are stated as fol- 


1. To determine, what combinations of one through three 
spectral channels out of four available channels give the 
greatest overall statistical separability of the .above seven 
land use classes. 

2. To compare tlie clas:iif icauion necurncy of land use 
classes u.sing 'single-cell signature acquisition' and 'multi-cell 
signaLiire acquisition' options of classification available in 
the liii;tge-10d and a sample classifier on-line-mode in the 
Image -100. 

Literature Review 


Many investigators have analysed the muluispectral scan- 
ner (MRR) data of J.v\NI)SAT satellite for applications co land 
use classification. Por example, Todd and Banmgardni»r^ (1973) 
tuialysed lAND.SAT MSR data, btaiued over Marion County (India- 
nupolif.) , Indiana, by compute r-implemented techniques, to 
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lilt roiluct iou 

nu‘ j'urpO‘ii of lliiii ?tiiilv vrtb to c*otr.i>aro flic cl.*ij.si 1 1 oa- 
t i cm .u'cur.icy ot laiul uso cl.issos uniu)*. iho Jit Nm out t'ptions 
ul clasHi t icat Ion availaMo in tlu* Iraap*- 100 ( is a 
Jiila i'l Oi'ciisi n^; systoin riarKOtod by Ooiiotal riocliic Co., that 
oxtiarts tlioaialic into! nation Itoin null I spocl i al 
onhaiu'os tho inuii’o , otc.). In tiJilitlon, ttio statistical sopa- 
rahility ot land iisi! classes in the subsets of one lo tour 
spectial ctiaimels was invest ip.ated. 

Cloud tree nult i spect ral scanner data tiom luNNDSAT, of 
ieasi>uable quality, over Sao di»se dos Canpi»s 10* S, 45^ 

iU* W) , Sao Taulu, brasil, acquired on September 8, I972^wcie 
available. In addition, aerial phi>t op.raphv and ground ubserva- 
lions weie available to assist t lu* analysis of tlie data. Sao 
Jose dos (\iut»os was selected because it is one of tlie most 
indiisli ialized and tastest growing small*-si/e towns of Brasil 
aid tkie authors ate well t.iaaliar with it.Manv of th»’ problens 
ul ttiih u>wn aie similar to t !ie ptoblems of much larg/M urban 
centers. * . 

With the help ol ground observ.kt ions arid aerial photo- 
graphy, a map ol Sao dose dos Carpus showing the !olli>wing 
land use classes wms obt.iiried: residential ..reas, multi-family 
residential areas, coumieicial area:*, industrial areas, insti- 
tutional .ireas , agricultural areas and unoceupied areas. 

Ttie specific objectives ol the stiuly are stated as fol- 


1. To detiMiuine what combi n. it i ous ol one through three 
spectral channels out of four available cliannels give the 
greatest overall statistical separability o! the a!)i»Vi‘ seven 
land use classes. 

2. To compare tlie classi I iration accuraev of land use 
classes using •siu)»,le-eel 1 signature acqui s i t i on' and *r.;ulli-ceil 
sign.ilutc acquisi t icui* i^ptions of classi t i cat i on .ivailable in 
th«* Image-lOO and a s.uuple classifier on-line-mode in the 
Image - li)0. 

hi Lei at lire Ueview 


Many inves t i gat ors have analysed tin* ir.ult ispectral sc.in- 
net data oi l-^U)8.\r :;4itellite for applications to land 

use classil icaliiai. Vor example, Todd and Bairng.ardiU' r ^ (l‘*7 5) 
analysed IwXNDSAT data, htained over Marion County Ondi.i- 
napoli; ) , Indiana, by comput c r-imp lemented lechnic;ues , to 
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evaluate the utility of satellite data for urban land use 
classitioal ion. Several laud use classes, such as commerco/i n~ 
dustry, s>ln[’,l‘i”J- ‘Unily (Newer) residential, trees, and water 
exhibited spectrally separable characteriatics and were 
identified with greater than 90 percent accuracy. Fllcfscn et 
nl^, (1973) did computer-aided analysis of I.AND51AT MdS data of 
Llie San Francisco Buy are.j. Smith ct. al.^ (I97>'i) liave piven 
the application of .spatial features to satellite laud-use 
analysis, lillefson et al.''(l97A) have piven new techniques 
in wappinp, urban land use and monicorinp, clianp.e for selected 
U.S. metropolitan areas, Xbey analysed IJVNPrfAT ISSS data usinp, 
automatic pattern recognition techniques lor classification. 
Kumar and Silva“(1977) have analysed the statistical .separabi- 
lity of agricultural cover types in much detail, data quantity 
and depth in the .subsets of one to twelve spectral channels. 

Kconomy ct al.*^(197‘i) have given tlie classificaCicn 
accuracy of the luiage-100 using single cell option (described 
later in the report). They reported an accuracy of 70 ± i27i. 
Factors which limited the classification accuracy wore spatial 
resolution of the ERTS-1 (now called laSNDSAT) scanner, tlie • 
land-use class if Icaciou system, and inaccuracicg in the ground 
truth. Goldberg ct. nl.'^ (1975) have described methods .and 
procedures which outside investigators may use with the auto- 
mated processing equipment of the Canada Centre for Remote 
Sensing (CCRS) for the purpose of natural resource exploration 
and mapping. They have compared the accuracie.s of unsupervi- 
sed and supervised methods on the basis of the confusion 
matrices generated by classifying exactly the same area. 


Method of Analysis 


Multispcctral scanner data of computer compatible tapes 
of LANDSAT were analysed u.sing Image-100. With the aid of 
land use map of Sao Jose dor, Campos mentioned above .rectangular 
areas of each of the above seven land use classes v;ere selec- 
ted, avoiding the boundaries of classes on the display of the 
Image-100. The areas of cacli of these classes were selected 
carefully so that they could be considered to be repre.sentative 
of the respective land use classes. Assuming that each of 
these dorses has a multivariate gaussian di.suributiou , Che 
U-distance, based on UhatCacharyya coefficient, was calculated 
between all possible pair.s of these classes in all possible 
combinations of one, two, three and four spectral channels 
using Che feature selection .ilgorithm of the Brazilian Insti- 
tute of Space Research (IMPE) on-line-mode with the Image- lOO/',? 
For each value of B-distance, the probability of correct clas- 
sificatioh was reasonably estimated from the curve of Swain 
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evaluate the utility of satellite data for urban land use 
clasr.i t icat ion. Several land use classes, such as con.**ncrre/i ir 
dustry, sinj*.le-f ami ly (Newer) residential, trees, and wat«*r 
exhibited spectrally separable characteristics and were 
idencitied with, {greater than 90 percent accur.icy. KHefsen et 
al^,(l9/J) did computer-aided analysis of Ij\NPSAT MdS d.nta o! 
the S.m r-ancisio Lay .irea. Smith et al.^ v197m) have p.iveii 
the .ipplication ot spati.il l\»atures to f^atelHce land-use 
atialysis. LlUison ut al.^tl974) have piven new techniques 
in laappinp, urban land use and monicorinp, chanpe lor selectee! 
U.S. metropolitan areas. Ihev analyseil IJVSPSAT ?![#S data usinj; 
automatic pattern recognition technique:, lor das: i l ication. 
Kumar and Si Iva*’ ( 1 97 7) have analysed the statistiial separabi- 
lity of agricultural cover types in L;uch detail, data qu.aitity 
and depth in the subsets of one to twelve speetral channels. 

Kconouiy et al.^(l974) have given the classi f * cat ic.n 
accuracy of the liiiage-100 using single cell option (described 
later in the report). Iney reported an accur.acy of 70 t 123. 
Factors vhicl* limited ihc classif icat ion aceuiacv were spatial 
resolution of the ERTS-1 (now called L/\NnSAT) sc.umer, tlie 
land-use class if icat ion system, and inaccuracicj in the ground 
truth. Coldberg et. nl.'^ (1973) have di*scrihed f^ethods and 
procedures which outside invest igators may use with the auto- 
mated processing equipment of Che Canada Centre for Remote 
Sensing (CCKS) for tiie purpose of natural resource exploration 
and raapping. They have compared the accuracies of unsupervi- 
bcJ .ind supervised methods on the b.isis of the confusion 
matrices generated by classifying exactly the same area. 

Method of Analysis 

Multispec tral scanner data of coirputer compatible tapes 
of U'.NDSAT were analysed using Imape-lOO. With th«* aid ot 
land iKse map of Sao Jose dos Campos mentioned above .rectangular 
areas of each of the above seven land use classes wore selec- 
ted, avoiding the boundaries of classes on tU* display of the 
Image-100. The areas of each of these clas^t?s w\»re selected 
carefully so that tiiey could be considered to be repre.sentat ivc 
of the respective l.md u:>e classes. Assuming that ♦•ach of 
these clarses lias a multivariate gaussiaa dist ribution , the 
b-distance, based on Lhat t acharyya coefficient, was calculated 
between all possible pairs of these classes in all possible 
cou;bi nations of one, two, three and four spcctr.al chanr.els 
using the tealure selection .ilgorithm of the Lia/alian Insti- 
tute of Space Uosearch (INl'li) on-liue-moJe with the I.'naj;e-100j*<? 
For each value of B-distance, the probability ol correct clas- 
sific.iciou was reasonably estimated from the curv»' of Swain 
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ami Kinf, (1973) The B-iliscanco for Lv’o probability .denailu?i» 
Pi(j:) mul PaCx) is j;iven by 
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If pi(x) ami P2(>i) ii'.ultivarioUc pnunsiim distributions as 
above, then 


B » 2(1 - e““) , 
where 
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vihere Uj and U 2 are mean vectors of classes one and two res- 
pectively; whereas Zi and 1*2 arc the covariance matrices of 
classes one and two respectively. 

and T denotes transpose (^0 


B-distance is defined for two distributions. Remote 
sensing, usually involves more than tuo classes. One sUvatecy 
is to compute tlic averapo B--distancc over all pairs of classes 
and select Che subset of features for which the averap.o B-dts-* 
tance is maximum, i.e., maximise, v’ith respect to nil n-tuples 
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wUeie \j\ and l*;» are mean veeCorh of classes one and two rcs- 
peclivoly; whereas Z\ aiul I 2 covariance mail ices of 

classes one and iwo respectively. 

3 : M i ^ j deaoles transpose (A) 


H-dietance is defined for iwo distributions. Remote 
sensiup. usually involves more than two classes. Oiu stratopy 
is to compute the averap.e h-distance over all pairs of classes 
and select the subset of features for winch the »iveiap,e R-dis- 
tance is maximum, i.e., maximise, with respect to ail n-tuples 
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where 


u: ^ nuni'er of classes 


*Hi • j I C j , . . . , C ) “ b-dist .luci? between classes i and 
j in the channels Cj , * • • • » • 




Uavj.' was coraputi.*d for nil possible subsets of one, two, 
three ami .four spectral channels, out of available four chan- 
ne Is . 

liach of these land use classes was divided into two inde- 
pendent sets: traininji fields and test fields. Usinp, training 
fields of residential areas, test fields of each of the above 
seven classes and training fields of each of the classes except 
residential areas were classified, using the .single -cull sig- 
nature acquisition option of Iraage-100. This option creates a 
four-dimensional rectangular parallelepiped, each of the sides 
of which correspond to the signature limits of the training . 
areas in each channel. The number of pixels clas.sified as re.si- 
dential areas by the computer inside the test fields of each 
of these seven classes were determined. An identical analysis 
was repeated for each of the other six land use cla,ssos. Thus, 
a confusion matrix showing the total number of pixol.s (picture 
elemeut.s) of each cla.s.*5 classified correctly as vrell as clas- 
sified incorrectly into each of the otlior classes was obtained. 

This whole procedure was repeated for the multiccll .sig- 
nature acquisition option of the Image-100. In- the multiccll 
signature acquisition, the parallelepiped of spectral signatu- 
re is subdivided into colls each of unit volume and the number 
of pi.xels in each of these unit cells is counted. These coll 
counts are thus measures of the probability distribution of 
Che spectral cluster. By raising or lowering the threshold on 
the cell counts, one can vary the sixe and four dimensional 
probability distribution of the spectral cluster, by deleting 
or adding cells with counts greater th.m a variable threshold. 
In the interactive signature modification option, the user 
performs training on the mis classi tied area adding the errors 
of omission and subtracting the errors of coimnission until s.a- 
tisf led with the results. The preliminary results showed that 
this option did not seem to improve much the classification 
accuracy of the ’single cell" option, because the bn.sic pro- 
blem was the overlap between the classes in the four-dimensio- 
nal spectral space. > 

The same training fields of each class were used to clas- 
sify the test fields using the sample classifier based on 
ll-distance. B-distance vjas compuced between a Cost field and.- 
each of the seven training classes and was classified into the 
class for which the B-distance was minimum. 
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^AVt' conputfd tor all possible subsets of one, two, 

tliiee and .four spectral chaaaels, out ol available luur chan- 
nels* 

bach of these l*md use classes was divided into two inde- 
pendent sets: training, fields and test fields. Usinj; training, 
fields of residential areas, teat fields of each of the above 
seven classes and training, fields ol each of the classes except 
residential areas were classified, usinp, the sinp.le-cell sig- 
nature ac<]uisition option of Ima^^e-lOO. This option creates a 
four-dimensioaal rectxui|»,ular paral Irlepiped , each of the sides 
ul which correspond to tl»e sip.nature limits of the traiuinr, 
areas in each channel. The nuiuber ol pixels classified as resi- 
dential areas by the coiv.puter inside llie test tields of each 
ut these seven classes were detcruuned. An identical analysis 
was rej^eated tor each of the other six land use classes. Thus, 
a conftisica matrix showing, Cue total number of pixel.s (picture 
clrmonts) ol each class classified correct Iv .is well as clas- 
sified incorrectly into each of the other classes was obtained. 

This whole procedure was repeated tor the multi cell sip,- 
nature acquisi tion optiiUi of the lmap.e-100. li> the injlticell 
signature acquisition, the parallelepiped of spectral signatu- 
re is subdivided intw' cells each of unit volume and the number 
of pixels in eacii of these unit cells is counted, llu'se cell 
Ci>uuts are thus measures el the prohabililv distribution ol 
tne spectral cluster. by raising or Krwering the threshold on 
the cell counts, one can vary the siitc and four dinensioual 
probability distribution ot the spectral cluster, by deleting 
or .idding cells wi tii counts greater than a variable threshold. 
In the interactive signature modification option, tliC user 
performs training on the tnisclnssi lied area adding the errors 
of omission and subtracting the eriors of coimaiss i on unti 1 sa- 
tisfied with the results. The preliminary results showed tliat 
this option did not seim to improve much the classification 
accuracy of the Single cell** option, because the basic pro- 
blem was the overlap between the classes in the tour-dimensio- 
nal spectral space. 

The same training fields of each class were used to clas- 
sify tlic test fields using the sample classitier based on 
H-distance. b-distaace was computed between a test field and 
each of the seven training classes and was classified into the 
class tor whicli the B-distance was mininum. 









Uusults and Conclusions 


j 'fnbiti 1 shows the distribution of j>ij;cl3 of the training 
and tlie test fields of each class. Tables 2 to 5 give tlie 
values of eil posssiblu combinations of one, two, throe 

and four channels respectively out of four available channels. 

As one would expect, the probability of correct classification 
increases witlj an increase in the numbor of chaimoLs. In the 
subsets of one to three spcsctral channels: channel A, channels 
I* & 7 (one in the visible and one in the near infrared) , chan- 
nels A, 5 6 7 (two in the visible and one in the near infrarciii 
are found to be the best choices .Table 2 shows that in the sub- 
let of two channels, channels A, and 5 (visible wavelength rc- 
I'ioii) give higher probability of correct classification than 
channels 6 & 7 (near infrared v/avelcngth region). The authors 
jjclicve chat each wavelength region — visible, near infrared, 
middle infrared and thermal infrared lias valuable spectral in- 
forniacicn. Thus, in the subset of two spectral channels, cnc 
channel in the visible and one channel in the near infriircd 
wavelength region are found to be the best choice. Kumar(1977)'^ 
has analysed the aircraft MSS data in much detail, data quanti- 
ty and depth in the subsets of one to twelve spectral channels, 
to evaluate each spectral channel as well as ’possible combina- 
tions of wavelength regions, for stati.stical separability of 
agricultural cover types.. . i 

' I I " ' 

Using Che training fields, of a class , the number of pixels 
classified by the Image-100 into each of the seven classes 
(training test fields, except the class being used to .train 
the computer in which case only test fields) are given in Ta’j- 
ble 6, I’rora Table 6, the errors of omission (while using trai- 
ning fields of residential, number of pixels of test fields of 
jrcsidential not classified as residential constitute the er- 
!rors of omission, etc.) and the errors of commission (while 
'vising training fields of residential, number of pixels of clas- 
^scs other than residential but which are actu.ally classified 
tas residential) were calculated and are shown in Table 7. Si- 
'milarly, the errors of omission and commission using the multi- 
Iccll signature acquisition (in = 1, ro - 2 and m •= .}) for Che 
isaine training and test fields of each class .were calculated 
and are given in Table 7. The option m ° 1 means that al.1 the 
unit cells in Che four dimensional spectral space which had 
less than one pixel were deleted from the spectral signature 
of the training fields for doing classification. Similarly, the 
option m ® 2 means that all the unit colls in the four dimen- 
sional spectral space which had less than two pixels wore dc- 
lletcd from the spectral signature of the training fields for 
doing classification etc. Table 7 shows that for the single- 
ice 11. op cion, the errors of omission vary from 5 Z for the cl.a.ss 



^ _ ra* 0 iilts ^tud Couclusioiis 

Table 1 shows the distribulien of pi;:c!s of the rraitiinn 
ond the test fields of each class. Tables 2 to 3 pJve th^ 
values of la all possible conbiautioas of cue « two^threc 

oad four cUaaaels respectively out of four available chaaaels. 

As oae would expect, Ciie probability ut correct rlassi f i cation 
increases with aa incre.ise ia the number of chanaels. I a the 
subsets oX one to tiiree spectral chnniiels: channel 4, channels 
U & 1 (one ia the visible and one in the near infrarcnl) , chaa- 
uols A, 3 ^ 7 (two ill the vi?;ible and otie in the near iafrare<.J 
ore found to be the best eaoiceff .Table 2 shown that ia the sub- 
set of two channels, ct;naaels A and 3 (visible wavelength re- 
gion) p.ive hip,hur probability of correct classification than 
channels o A 7 (near infrared v/ave length rep.ion). Tlie authors 
believe that eacli wavelenp,ch region — visible, near infrared. 
Diddle infrared oi\d Chennai infrared has valuable spectral in- 
forr.aticn. Thus, in the subset of two spectral channels, cnc 
channel in the visible and one channel in the near infrared 
iwavc length region are found to be the best choice. Kunar (1977)^' 
has analysed the aircraft MSS data in much detail, data quanti- 
ty and depth In the subsets of one to twelve spectral channels, 
to evaluate each spectral channel as well as possible combinri- 
Jtious of wavelength regions, for statistical separability of 
agricultural cover types.. ^ 

! !. ‘ 

Using tti6 training fields of a clar.s, die nutnbcr of pixels 
classiliod by the luagwlOO into each of the seven classes 
|(training ♦ test fields, except the class being used to train 
the con.puter in wtiich case only test fields) are given ia Ta- 
ble b. From Table 6, the errors of omission (while using trai- 
ning fields of residential, number of pixels of test fields of 
residential not classified as residential constitute the er- 
rors of omission, etc.) and the errors of coinniission (while 
using training fields of residential, number ot pixels of clas- 
ses other tiian residential but which are actually classified 
^as residential) were calculated and are shown in Table 7. Si- 
milarly, the errors of omission and coioiisnion usin** Che multi- 
cell signature acquisition (m ■ 1 , ti * 2 and m » 3) for the 
same training and test fields of each class were calculated 
iaind are given in Table 7. The option m ” 1 means that all the 
unit cells in the four diir.ansional spectral space which h.";d ... 
jlcss than one pixel were deleted from the spectral signature 
of tlie training fields for doing classification. Siniilai ly, the 
'option m “ 2 means that all the unit culls in die four diricri- 
sional spectral space which had less d:an two pixels wcie de- 
ileted from die spectral signature of trie training fields for 
doing classilication etc. Table 7 shows that for the single- 
cell option, the errors of omission v<»ty from 5 7. for the class 
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Channe 1 

*^AVE 

h 

0.7787 

5 

0.7250 

6 

0.6267 

7 

0.35 10 


In subset o£ one cliannul 


Estimate'll ]>crcunta}’,e probability oC cor- 
ruct clnssiiication ( Swa in and Klnp, ,1973) 

82.4, 

: 81.(3 

.T , ■ y2.Q- ' •■•■■• 

I 69.6 


Table 3. Values of b..,„ in subset of two channels 

U ‘ i 


Channels 

Vii 

Estimated.. . . * (Swain and Kins, 1973 

4-5 

0.8877 

84.8 

< - 4-b 

0.8959 

i ■ ■ • 85.(3 

4-7 

0.9599 

• i 86.4 i . 

5-6. 

0.8436. 

i 83.2 j- 

5-7 

0.8775 

84.8 1 

6-7 

0.6075 

1 77.6 . ' 

1 ^ 


TablQ 4. Values of B.,„, in subset qf three chaunelii 


Channel 

® ®AVE 

Estimated.... (Swain and King, ] 

.973) 

4-5-6 

0.9920 

•’ 87.2 


4-5-7 

1.0449 

88.0 1 


4-6-7 

1.0186 

■' 87.2 

} 

5-6-7 

0.9504 

85.7 

1 



Table 5. Values of ui the available four channels 


Chaune Is b . 

AVb 


A-5-6-7 1.09 AO 


Estimated. . . . (Swain and King, 1973) 


Note: IS defined in eq. (5) 





























Tabic 2, V.tlut j of 1>.,„ in subset of one cti.nmcl 

A\i. 



Ciuiaue 1 

^AVL 

Ea t i mated I'cr cent a>*.e 
rect classification ( 


4 

0.7767 

• 

1 

1 

1 

1 

1 

1 

1 


5 

0. 7250 

... . ‘ 31.0 


0 

0.42o7 

' ■ ’ ‘ ‘ ' 72.3 


7 

0.3510 

1 09 .6 


Table 3. Values of in subset of two channels 


Channels ^*AVL Estimated.. 

U-b O.Bb77 

4- 0 J.tJJby ‘ 

4~7 i 

b-0 O.b43o 

5- 7 0.6775 

b-7 O.O075 


(Strain and King, 19 7 3 


Table 4. Values of B.,,. in subset of three channels 


Lstiuiated. . . . (Swain ai\d Kinc, , 1973) 


f U.,., in the available four channels 
AVb 


Estimated. 


(S’wain and Kini:. 1973 


1.094'J 


is defined in et]. (5) 
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Table '7. l'oi*ccutai;i' errors oC omissian ami ccuraissiott. 
in ’iiinfile cell' ami 'multi cell' signature 
.acquisition 

(A) Percentage urrors oC omission 


Class 

i ^ 

MulLi<ell 

Multiple 11 

Multi-cell 

Single-Cel 1 

m “ 1 

m « 2 

m o 3 

Uesidential 

20.8 

63.21 

73.91 

83.61 

Multi- family 
Kesidential 

33.3 

, 80.21 

88. 5 A 

9A.27 

Commercial 

16.3 

7A.71 

78. A9 

83. A3 

Indus trial 

5.0 

68. AA 

8o.OO 

90.00 

Xns tltutioual 

^6.3 

59.32 

82.56 

87.50 

Agricultural 

19 -A 

73. A6 

80.26 

86 .69 

Unoccupied 

16.9 

77. A6 

'82.30 

85. AO 


(B). Perruutage errors of 

commission 


Class 

Single cell 

Multi'cell 

Multi*co 11 

Multi-cell 

m = 1 

m » 2 

m ^ 3 

Kesidential 

25.28 

5.32 

3.38 

2. A3 

Multi-f ami ly 
Kesidential 

35 . 6 A 

0.92 

0.27 

0.01 • 

Commercial 

10.99 

1.75 

0.81 

O.Al 

Industrial 

39.00 

11.08 

1.83 

1.59 

Ins ti tutional 

38.00 

26.65 

A.A8 

2.57 

Agricultural 

21. A 1 

5.20 

A. 02 

3.20 

Unoccupied 

25.99 

2.97 

1.58 

1.23 


Note: The option m « 1 nwat. i that all Che unit cells in the 

four dimensional spectral space which had less than one 
pixel were deleted from the spectral si^’nature of the 
training fields for doinc classification. 




l.iblv 7. rcrccnt 0 i;i' tfrroiK i>( omission and cruinissiotv 
in *sini;lc evil' and *mn lei cell' sIt.nAturu 
, ac<)uisiLiun 

(A) I'crccMtlai’.c crrois of omission 


Clsias 

biiigle*Cell 

MulLwell 

,'Ju 1 1 ive 1 1 

Multi<cll 

«a • 1 

n • 2 

m ■ 3 

Kcsiiiciiti al 

2d. 6 

63.21 

73.91 

83.61 

MuUi-( .imi ly 
Ke^iiiculial 

JJ. J 

80.21 

88.54 

• 

94.27 

Ci^uxuerci al 

Id. 3 

74.71 

78.49 

83.43 

Indiisirial 

5.0 

d8. 44 

£u.00 

90.00 

Ins tiiulioual 

Md. 3 

59.32 

82 . 5d 

87.50 

Ai*,ricaltural 

19.4 

73. 4d 

80.26 

8d .d9 

L'nocCupicU 

1C .9 

77. 4d 

82.30 

SS.40 


(iJ). I’crreiitagc errors of 

ctxnmi&sion 


C« 1 as s 

Single ce 1 1 

Mu 1 liKTO 1 1 

Mu 1 L i-ce 1 1 

.Mu 1 1 i<e 11 

n ■ 1 

c» • 2 

rt ■ 3 

Hesident ial 

25. 2d 

5.32 

3.38 

2.43 

Mult i-f ami ly 
KesiilcuCi sil 

35. d4 

0.92 

0.27 

0.01 • 

Cocsra*rcl al 

10.99 

1.75 

0.81 

0.41 

Itidus trial 

39. OC . 

11.08 

1.88 

1.59 

Iiisti tutioual 

38.00 

2d.d5 

4.48 

2.57 

A^;ri cultural 

21.41 

5.20 

4.02 

3.20 

Unoccupied 

25.99 

2.97 

1.58 

1.23 

Note: The Oi>cioii 

1 ni • 1 mean. 

that all Che 

unit cells 

in the 


four dimensional spucltAl space which had less than one 
pixel were deleted from the spectral sip,nature of the 
training; tields for doing classif ication. 
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industrial to 46'i Tor tho ol.ias infitiutcio».il. Tho errors of 
cotifinisaion vary frow UI2 for the class coiunercial to 3VZ for 
the class. industrial. It shows that classification accuracy for 
all the classes is rather poor except the class cor.ur,ercinl vjhe- 
rc the percentOG‘>‘ errors are reasonably .small (arrors of 
otttission “ 16.32, coraiission « 112). This is because of sniall 
values of standard deviation (and hence lass overlap) of this 
class in each of the spectral channels especially in the chan- 
nels one (0.5 to 0,0 piu) and four (0.8 to 1.1 pm). In general, 
an increase in Che standard deviation.^ of a class in the spec- 
tral channels tends to reduce the errors of omission and in- 
crease the errors of comission. it was found that taking into 
account both the errors of omission as v.?ell as the errors of 
coninissiou, the classification accuracy generally reduces with 
VAX increase in the standard deviations. TIjc sum of standard 
deviations for, the class industrial in the four spectral chan- 
nels were much higlier as compared to the sum of standard devi- 
ations for any of Che other classes. The intervals of its 
spectral response in each of the four spectral channels were 
very large and ch.nc is why it has lo-w errors of omission. The 
errors of omission as well as commission for the class insti- 
tutional are vary large. An institution can have office buil- 
dings .parking areas, recreation areas, housing for employees, 
workshops, trees, etc. Thus, it w.ould require’ a large number 
of training areas for them to be truly representative of the 
class institutional. Tiie . autlior.*; believe tliat tl»e training 
areas of the clas.s institutional were perhaps not truly rc-‘ 
presentative of the class institutional and hence, Ciic errors 
of omission as well as commission were found to be rather lar- 
ge. 

The first element of the last row of T.nble 6 shows the 
total number of pixels that have been incorrectly classified 
as class residential divided by the total number of training 
and test fields of all the classes except the class residen- 
tial. In this calculation, it should be noted that many 
pixels may h.ave been classified incorrectly more than once as 
a class residential. Similarly, the other elements of the 
last row are self explanatory. 

Table 7 allows as expected, that the multicell option 
increases the errors of omission and decreases the errors of 
commission. Considering the errors of omission as well as- 
errors of commission, multi cell option for m « 1 considerably 
decreases the percentage if correct classification for each of 
the classes. This is because the number of pixels used Cor 
training of eoch class were relatively small for statistical 

purpose. Thus , the unit cello in tiie four dimensional .soeccral 
space were sparsely populated. Thus, there m.ay be many cells 
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iiiiluatrial lu ^OZ (or the cla:>» iiihiitucioti.il. lli<> orrorit of 
co:iinii 2 ttio.i vary frou IIZ lot tlic claaa cuninctcidl to for 
the cla:>E. iadittl* « al. It lihouii that classlf ic.nlicn accuracy for 
all Ihv* clnshes is rattier pour except tlie class conra-rcial whe- 
re ilie perceulai'.e of errors are reabonably small (errors of 
Offlissiuo “ lO.dZ, conaiihsioi) * HZ). This is because of small 
values of scaiiiiard Juviutlua (and huoce less over lip) of this 
class in each of the spectral ch.inuels especially in the chan- 
nels one (0.3 to O.b pm) .ind four (0.8 to 1.1 pm). In {general, 
an increase in tia! st.-tndard deviations of a class in llie spec- 
tral cliannels tends to reduce tlie errors of omission .ind in- 
crease Lite errors ot courission. .t was found that takirip, into 
account noth the errors of omission as well as the errors of 
coniaission, the classification accuracy p.enerally reduces with 
an increase in Che standard deviations. The sum of standard 
deviations for the class industrial in tltc four spectral chan- 
nels were much hir.itrr as compared to cite sum of standard devi- 
ations for any ot the other classes. The intervals of its 
spectral response in e.ich of the four spectral ch.'mnols wore 
very largo and tliat is wliy it has Itw errors of omission. The 
errors of omission as well as coirr.ission for the class instt- 
tutiunal are very large. An institution can have office buil- 
dings ,p. irking areas, recreation areas, housing for employees, 
workshops, trees, etc. Thus, it w.ould require' a large numbei* 
of training areas for them to be truly representative of the 
cla.->s institutional, 'the authors believe that Che training 
areas of tlie class institution.il were perhaps not truly re-‘ 
preseiitativc of the class institutional ^<td hence, the errors 
of omission as well as comeussiun were found to be rather lar- 


Tiie first element of tiic l.^st row of T.ibic 6 shtvs the 
total nuniber of pixels that have been incorrectly classified 
as class residential divided by the total number of training 
and test fields of all the classes except the cl.iss residen- 
tial. In this calculation, it should be noted that many 
pixels may h.ive been classi fied incorrect ly more than oacc as 
a class residential. Similarly, the other elements of the 
last row are sell e.xplanatory. 

Table 7 shows as expected, that the multicell option 
increases tl.e error.s of omission and decre.iscs the errors of 
conmii r.sLon. Considering tiie errors of omission as well as 
errors of comiimssion, r.ulticell option for m - I considerably 
decreases the percentage >f correct cl a.-ss I f icaC Lon for each of 
the C 1 . 1 SSCS. 'fhir. is because the number of pixels used for 
training of each class were relatively small for statistical 

purpose. Tlius , the ur.it cells in the four Jiriensional spectral 
space ueie sparsely pope l.iled. Thus, Ib.i’re may be m.iay cells 
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which. nrc actually reprcijeatativc of the class butUo not have 
any pixel because the total number of pixels for trnininp, for 
cacli of the clasaes was rather small. Similarly, for the . 
multi'ccLl option, the errors of omission increase and the 
errors of commission decrease slcu/ly as we {jo from m ° 1 to 
m “ 2 to m “ 3. Considering the errors of omission as well as 
the errors of concussion, the percentage of correct classifi- 
cation decreases slowly ns we go from ni " I to w » 2 to m ” 3. 

Table d shows the results of classification using a sam- 
ple classifier. As pointed out c.arlier, the same training 
and test fields were used in this case as in the "single coll" 
or "multiccll''option of the Image-100. Unfortunately, one 
field of the class institutional could not be classified 
using sample classifier due to tompornry malfunctioning of 
the Iwagc-100 at that time. Comparing these results to 
the single-cell option, we find that the errors of commission 
are reduced significantly, whereas the errors of orai.usion are ■ 
increased for some classes and decreased for the oubers. The 
eT'rors of omission for the class industrial arc very large 
because the standard deviation of the class industrial was 
extremely large in- each of the four spectral channels. Kor 
most classes, the larger were the standard deviations, the lo- 
wer was the percentage of correct classification 'using .sample ■ 
classifiv. . Comparing sample classifier to the ’multicclL 
option’ find that it gives much smaller errors of omission, 
whernas it gives greater errors of conmiission for some clas- 
ses and smaller for the. others. On the whole, the. sample 
classifier gives percentage of correct classification much 
better than the single-cell or multi-cell option;; of the Xmage- 
100. Similar analysis will be done using the 'pixel by pixel 
maximum likelihood gaussian classifier'. In addition, compu- 
ter compatible tapes of S.J. dos Caropo.5 of other times will 
be analysed to investigate the effect of time on these results. 

.Tile authors gratefully acknowledge the assistasice of 
Ur. Nelson de Jesus Parade,. Director of the Instituto de Pcs- 
^QU’i'sas.Espaciais'KXHPE); Dr. Cclso de Renna c Souza, Mr. J.C, 
Moreira and. Mrs. M.S.S. Barros of INPH for their assistance 
with this work. 







which arc acLually roprcsciiCaCivc of lh« class hue do not have 
any pixel because the total nunhor of pixels (or traininp, for 
each of the clasnes was rather snull. l>in.ilarly, for the . 
itulticcll option, the errors of omission increase and the 
errors of co;;snission decrease sItsUv as we go from m ■ I to 
m •* 2 to tr. • 3. Considering the errors of omission as well ns 
the errors of cououission, the percentap.e of correct classili** 
cation decreases slowly as we ^o from ci" i tota*2 tom" 3. 

Table 0 shows the results of classif ication usinp, a sam- 
ple classifier. As pointed out earlier, the sane training 
and teat fields were used in this case as in the "single cell" 
or "multi re I i"option of tlie Image-lOO, Unfortunately, one 
field of the class institutional could not be classified 
usiu£ sample classifier due to tenpornry malfunctionins of 
the lua^e-lUO at that tio.e. Comparing those results to 
the single-cell option, we find that the errors of commission 
are reduced significantly, whereas the errors of omision are 
increased for some classes and decreased for the owhers. liie 
etrors of omissicn for the class industrial arc very lar,-c 
because the standard deviation of the class industrial was 
extremely large in each of the four spectral channels. For 
most classes, the larger were Che standard deviations, rite lo- 
ver was the percentage of correct classification using. sample 
classif’... Comparing s.atiple classifier to the 'multiccll 
option' • find that it gives much smaller errors of cnissio.i, 
whergAu it gives greater errors of cont-iKsion for some clas- 
ses and smaller for the others. On the wl-.olc, the sample 
classifier gives percentage of correct clnssil i cation much 
better than the singlc-cell or multi-cell option*; of the Imagc- 
ICU. Similar analysis will be done using the 'pixel by pixel 
maximum likelihood gaussian classifier*. In addition, cenpu- 
tcr compatible cepes of S.J. dos Campos of oClier times will 
be analysed to investigate the effect of tine on these results 
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Table 8. Resulcs of classification usir4g sample cla?:sifier 
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